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Abstract —How would a cellular network designed for maximal 
energy efficiency look like? To answer this fundamental question, 
tools from stochastic geometry are used in this paper to model 
future cellular networks and obtain a new lower bound on the 
average uplink spectral efficiency. This enables us to formulate a 
tractable uplink energy efficiency (EE) maximization problem 
and solve it analytically with respect to the density of base 
stations (BSs), the transmit power levels, the number of BS 
antennas and users per cell, and the pilot reuse factor. The closed- 
form expressions obtained from this general EE maximization 
framework provide valuable insights on the interplay between 
the optimization variables, hardware characteristics, and prop¬ 
agation environment. Small cells are proved to give high EE, 
but the EE improvement saturates quickly with the BS density. 
Interestingly, the maximal EE is achieved by also equipping the 
BSs with multiple antennas and operate in a “massive MIMO” 
fashion, where the array gain from coherent detection mitigates 
interference and the multiplexing of many users reduces the 
energy cost per user. 


I. Introduction 

The biggest challenges for next generation wireless commu¬ 
nication systems (5G) are to support the ever-growing demands 
for higher data rates and to ensure a consistent quality of 
service (QoS) throughout the entire network To meet 
these demands, the network area throughput (in bit/s/km 2 ) 
needs to increase by a factor of 1000 over the next 10- 
15 years |4). At the same time, the power consumption of 
the information and communication technology (ICT) industry 
and the corresponding energy-related pollution are becoming 
major societal and economical concerns j5j. Credited sources 
foresee that, to meet such a 1000 x higher data traffic without 
increasing the ICT footprint, new technologies that improve 
the overall energy efficiency (EE) by 1000 x must be devel¬ 
oped |6j] . Hence, higher network area throughput on the one 
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hand and less power consumption on the other are seemingly 
contradictory 5G requirements 0. There is a broad consensus 
that these goals can only be jointly achieved by a substantial 
network densification. 

Two promising technologies towards network densification 
are small-cell networks 0- 0 and massive MIMO systems 
Q3-Q3- The first technology relies on an ultra-dense and 
irregular operator-deployment of low-cost and low-power base 
stations (BSs), with higher density where the user load is 
higher. Bringing the BSs and user equipments (UEs) closer to 
each other can increase the area throughput, while significantly 
reducing the radiated signal power. However, this may come 
at the price of a substantial increase of the circuit power con¬ 
sumption (per km 2 ) due to the larger amount of hardware and 
infrastructure G3- In contrast, the massive MIMO technology 
aims at evolving the conventional BSs by using arrays with 
a hundred or more small dipole antennas. This allows for 
coherent multi-user MIMO transmission where tens of users 
can be multiplexed in both the uplink (UL) and the downlink 
(DL) of each cell. It is worth observing that, contrary to what 
the name “massive” suggests, massive MIMO arrays are rather 
compact; 160 dual-polarized antennas at 3.7 GHz fit into the 
form factor of a flat-screen television 03 - In massive MIMO 
systems, the area throughput is improved by the multiplexing 
gain, while the array gain from coherent processing allows 
for major reductions in the emitted power. Similar to small¬ 
cell networks, however, the potential throughput gains from 
massive MIMO come from deploying more hardware (i.e., 
multiple antenna branches per BS), which in turn increases 
the circuit power consumption per BS. 

In summary, both densification technologies can improve 
the area throughput and reduce the radiated power, but at 
the cost of deploying more hardware infrastructure. Hence, 
the overall EE of the network can only be improved if these 
benefits and costs are properly balanced. The main objective of 
this paper is to develop an analytical framework for designing 
dense cellular networks for maximal EE, as well as to provide 
guidelines for practical deployment when (among others) the 
spectral efficiency (SE), radiated power, BS density, number of 
antennas and UEs per BS, channel estimation, and the circuit 
power consumption are taken into account from the outset. 

A. Related Works 

The EE of cellular networks has been defined from different 
perspectives in the last decade (B), (T§. One of the most 
common definitions is a benefit-cost ratio, where the service 
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quality per area unit (a.u.) is compared with the associated 
energy costs. In this paper, the following general definition is 
used: 


EE = 


Area spectral efficiency [bit/symbol/km 2 ] 

Transmit power + Circuit power per a.u. [J/symbol/km 2 ] 

which yields to an EE metric measured in bit/jQ 

Most of the early works that analyzed the EE, as defined 
above, have focused on the single-cell case wherein the 
interference from other cells is neglected. Examples in this 
context are & G3-E3- In particular, in (17) the authors 
consider the UL power allocation of multi-user MIMO systems 
and show that the EE is maximized when specific UEs are 
switched off. The UL was studied also in (18) , where the EE 
was shown to be a concave function of the number of BS 
antennas, M, and the UE rates. The DL has been investigated 
in (13), 1191. [20], whereof (13) and (T9) show that the EE is a 
concave function of M, while a similar result is shown for the 
number of UEs, I \, in |20j. Unfortunately, in these works, the 
system parameters were optimized by means of simulations 
that (although useful) do not provide a complete picture of 
how the EE is affected by M and K. Recently, ED derived 
the optimal M and K for a given UL sum rate, while [23] 
derived the optimal M, K and UE rates to maximize the EE 
jointly for the UL and DL. These two works utilize a realistic 
power consumption model where the dissipation in circuits 
depends non-linearly on the above parameters. Simple closed- 
form expressions for the EE-maximizing parameter values 
and their scaling behaviors are derived for zero-forcing (ZF) 
processing with perfect channel state information (CSI) and 
verified by simulations for other processing schemes under 
imperfect CSI. 

The EE analysis of multi-cellular networks is much more 
involved than in the single-cell case due to the complicated 
network topology and the arising inter-cell interference. The 
simplest approach is to rely on heavy Monte Carlo simula¬ 
tions. Attempts in this direction can be found in J24) and 
[25). Unfortunately, Monte Carlo simulated results are often 
anecdotal since one cannot separate fundamental properties 
from behaviors induced by parameter selection. Alternatively, 
simplified network topologies can be considered, such as the 
Wyner model (26) or the symmetric grid-based deployment 
J23) . While attractive for its analytical simplicity, the Wyner 
model does not well capture the essential characteristics of 
real and practical networks ED- Similarly, the symmetric grid- 
based models cannot capture the irregular structure of small¬ 
cell network deployments. 

The need for developing tractable (yet reasonably accurate) 
models for future dense networks has increased the interest 
in random spatial models; more particularly, in using tools 
from stochastic geometry [28) , wherein the BS locations form 
a realization of a spatial point process—typically a Poisson 


'Some prior works have considered erroneous EE metrics in bit/J/Hz, by 
forgetting to scale both the numerator and the denominator with the symbol 
rate (or the bandwidth) that was explicitly (or implicitly) assumed in order to 
compute the noise power and the dimensionless SNR value. These works are 
not mentioned in this section for obvious reasons. 


point process (PPP). A major advantage of this approach is the 
ability of providing tractable expressions for key performance 
metrics such as the coverage probability and the average SE 
of the network. A few prior works have also derived EE- 
related performance metrics and showed how these depend 
on the BS and UE densities; for example, f29) compares the 
deployment of two types of single-antenna BSs, while (30) 
studies the EE when multi-antenna BSs serve one UE each. In 
1311, the authors focus on the analysis and design of energy- 
efficient heterogeneous cellular networks (HetNets) through 
the deployment of small cells and sleeping strategies. The 
effect of cross-tier BS cooperation and clustering on the EE 
in the DL is studied in [32) . In contrast to the UL analysis in 
our paper, the vast majority of work on the EE in multi-cell 
networks using stochastic geometry has focused on the DL. 
Furthermore, prior work usually aims at deriving closed-form 
SE and EE expressions, which often turn out to be involved 
and cumbersome to optimize, thereby providing little or no 
useful insights on the optimal system design. 


B. Major Contributions 

This paper considers the UL of a multi-cell multi-user 
MIMO network in which the BSs are distributed according 
to a homogenous PPP of intensity A. Each BS is equipped 
with M antennas and communicates with K single-antenna 
UEs uniformly distributed within its coverage area. Coherent 
detection based on maximum ratio combining (MRC) is used 
at the BSs, based on CSI acquired from UL pilot signaling 
with each pilot symbol being reused in 1//3 of the cells. 
In addition, we assume that the UEs are prone to practical 
transceiver hardware impairments and apply UL power control 
based on statistical channel inversion (with proportionality 
coefficient p), to resolve the near-far effects that otherwise 
would prevent communication [ [33) . Tools from stochastic 
geometry and classical statistics are used to compute a new 
lower bound of the average SE. This expression is then used 
to define the EE metric by also using the power consumption 
model developed in |[22), which not only accounts for the 
radiated power but also for the operating power consumption 
required by (among others) analog transceiver chains, digital 
processing, and backhaul infrastructure. 

Within the above conditions, an EE maximization problem 
is formulated under the assumption that a given average 
SE target per UE must be met with equality, to guarantee 
good service quality. This problem is solved analytically with 
respect to the tuple 6 = (/?, A, p 1 M, K) of optimization 
variables. The resulting closed-form expressions provide valu¬ 
able design insights on the interplay between the different 
system parameters and the various components of the power 
consumption model. We show analytically and numerically 
that the radiated power is negligible in most practical cases 
and that massive MIMO configured BSs appear naturally since 
this technology can protect the desired signals from inter-cell 
interference and share the circuit power between many UEs. 


C. Outline and Notation 

The remainder of this paper is organized as follows. Section 
[II] introduces the system model for the network under inves- 
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Fig. 1. Illustration of one realization of BS positions from the homogeneous 
PPP and of UEs uniformly distributed in the corresponding Poisson- 
Voronoi cells. 


tigation, including hardware impairments at the UEs and CSI 
estimation at the BSs. A new and tractable lower bound on the 
average SE is derived in Section III It is used in Section IV to 


formulate the EE maximization problem and to obtain closed- 
form expressions for the EE-maximizing pilot reuse factor, BS 
density, and transmission power as well as the number of UEs 
and antennas per BS. The theoretical analysis is corroborated 
by numerical results in Section [V] while the major conclusions 
and implications of this paper are drawn in Section [VT| 

The following notation is used throughout the paper. The 
notation E{-} indicates the expectation with respect to a 
random variable, whereas || ■ || and | • | stand for the Euclidean 
norm and absolute value, respectively. We let I m denote 
the M x M identity matrix, whereas we use to 

denote a multi-variate circularly-symmetric complex Gaussian 
distribution. We use C, Z + , and R to denote the sets of all 
complex-valued numbers, all positive integers, and all real¬ 
valued numbers, respectively. The Gamma function is denoted 
by T(-). 


II. System Model 

We consider the UL of a cellular network that is designed 
to serve a dense heterogeneous distribution of UEs. This is 
modeled using the stochastic geometry framework adopted in 
[34 1 in which the BSs are distributed spatially in R 2 according 
to a homogeneous PPP of intensity A (measured in BSs 
per km 2 ). More precisely, this means that in any area of 
size A (measured in km 2 ), the number of BSs is a Poisson 
distributed stochastic variable with mean value AA The BSs 
are uniformly and independently distributed over the area. 
Each of them is equipped with an array of M antennas and 
serves K single-antenna UEs, which are selected at random 
from a (potentially) very large set of UEs in the cell since we 
treat K as an optimization variable. We assume that each UE 
connects to its closest BS such that the coverage area of a BS is 
its Poisson-Voronoi cell; see Fig.[T]for an illustration. The UEs 
are assumed to be uniformly distributed in the Poisson-Voronoi 
cell of their serving BS. Note that the geographic locations of 
UEs and BSs are correlated under this model, such that small 
cells serve more UEs per km 2 than larger cells. We interpret 
this as a judicious network deployment where the BSs are 


matched to a heterogenous distribution of UEs j^] 

The translation invariance of PPPs allows us to perform 
statistical performance analysis for a typical UE (located for 
example in the origin), which is statistically representative for 
any other UE in the network (35 J. Assume that this typical 
UE has the arbitrary index k and is connected to a BS that is 
called the typical BS and is denoted as BSo G $>. Then, the 
following basic properties hold (see for example (36)): 

Lemma 1. The distance dookfrom the typical UE to its serving 
BS is Rayleigh distributed as doofc ~ Rayleigh (-^=). The 
BSs of the other cells form a homogenous PPP \Eu = \ 

{BS 0 } in R 2 \ {x G R 2 : ||x|| < d 00 k}- 

Using the notation introduced in Lemma |T| each interfering 
BS, BSj G v lG, is situated at the geographical position 
determined by its spatial point BSj G R 2 . Since in most cases 
we are interested only in the index of the interfering BS, the 
shorter notation j £ \Eu is used henceforth for BS ; G 

A. Channel Model and Power-Control Policy 

We consider a UL transmission protocol where the time- 
frequency resources are divided into blocks of T c seconds and 
W c Hz. This leaves room for a total number of S = T C W C 
transmission symbols for pilot signaling and data. The channel 
response h ijk G C A 1 between BS; and UE k in cell j is mod¬ 
eled as block-fading such that h ijk has a constant stochastic 
realization within a block and takes independent realizations 
across blocks. Since the number of BS antennas is treated 
as an optimization variable in this work, we need a channel 
model that is reasonable for both small and large arrays. 
For this purpose, we make use of a Rayleigh fading model 
as it well-matches non-line-of-sight measurements in both 
cases (37). Hence, we consider Rayleigh fading channels with 
h ijk ~ CJ\f(0,uj~ 1 df-^lM)^ where dijk is the propagation 
distance (measured in km) and a > 2 is the pathloss exponent. 
The parameter u> accounts for the pathloss at a reference 
distance of 1 km and can thus be used to model distance- 
independent propagation losses (such as wall penetration). 

Power control is an essential mechanism in the UL of 
any multi-user MIMO system; the finite dynamic range of 
analog-to-digital converters at the BSs otherwise creates near- 
far blockage where weak signals from distant UEs drown in 
stronger signals from nearby UEs. Statistical channel inversion 
is a convenient power-control policy (33) , where UE i in cell 
j uses the transmit power 

Pji = Pud'jji (1) 

with p > 0 being a power-control coefficient to be designed^ 
The average transmit power at UE i in cell j will then be 

ur i ur^Q i T(a/2 + l) 

E{ Pjl } = = pu ( ^ A)a/2 (2) 

2 Alternatively, the UEs could have been distributed according to an inde¬ 
pendent homogeneous PPP [30| , leading to independent BS and UE locations. 
This is less sensible since smaller cells would on average have fewer UEs 
than larger cells, which contradicts the main principle of densifying networks 
mainly at places with high user loads. 

3 We show later that the transmit power is low at EE-optimized operating 
points. To simplify the notation, we have therefore not specified any maximal 
transmit power constraints at the users. 
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by using the distance distribution from Lemma [T] A key fea¬ 
ture of the statistical power-control policy is that it provides the 
same average effective channel gain of E {pjiljhjjiH 2 } = Mp 
to all UEs (irrespective of their locations) without requiring 
any rapid feedback mechanism to provide UEs with instanta¬ 
neous CSI. Note that this does not mean that the signal-to- 
interference-plus-noise ratio (SINR) is the same for all UEs, 
since the interference varies in the network. 


B. Transceiver Hardware Impairments 


Any practical wireless transceiver is affected by hardware 
impairments; for example, unavoidable clock drifts in local os¬ 
cillators, finite-precision digital-to-analog converters, amplifier 
non-linearities, non-ideal analog filters, etc. Such impairments 
can be partially mitigated by compensation algorithms | [38) , 
but not fully removed. Although negligible at low signal-to- 
noise ratios (SNRs), hardware impairments characterize the 
maximal achievable SE at high SNRs |39) . The vast majority 
of prior works on the design of energy-efficient networks 
assumes ideal transceivers, although hardware impairments 
might have a fundamental impact also on the EE. Motivated 
by this fact, the distortion noise arising from hardware im¬ 
pairments is included in the system under investigation. We 
concentrate on UE impairments, which is expected to be 
the dominating effect since small BS arrays can use high- 
grade hardware and massive arrays are resilient to hardware 
impairments ]40). 


Denote by .Sj t £ C an arbitrary symbol transmitted by UE 
i in cell j and assume that it is normalized to unit power 


(i.e., E{|sji| 2 } = 1). Similar to [381, (39], |4l] , the hardware 


impairments are modeled as a reduction of the desired signal 
power by a factor of 1 — e 2 , with 0 < e < 1, and by adding 
Gaussian distortion noise that carries the removed power. More 


precisely. 




is 


replaced with \/l — e 4 s 






where 


-jl 


CAf(0, e 2 ) is the additive UL distortion noise caused at UE i 
in cell j. In these circumstances, the received signal y 0 £ C M 
at BSq takes the form 


K 


i=1 


K 


yo = ^2 VmWl - e 2 s 0l + e 0 j)hi 


00i 


(3) 


T, y e2s ji + e ji) h 0ji + n 0 

je'h »=i 


where n 0 ~ CAT(0, ct 2 Im) models receiver noise with vari¬ 
ance a 2 . The parameter e is referred to as the level of impair¬ 
ments and is tightly connected to the error vector magnitude 
(EVM), which is a common quality measure of transceivers^] 
The 3GPP LTE standards specify EVM requirements in the 
range [0.05,0.175], where higher-order modulations are sup¬ 
ported if the EVM is below the lower part of this range | |42) . 
Note that the ideal hardware case is obtained by simply setting 
e = 0. 


4 The EVM is usually defined as the ratio between average distortion 
magnitude and signal magnitude, which becomes e/s/l — e 2 ~ e with our 
notation. 


C. Channel Acquisition 

As it is known, coherent processing of the received signal 
yo requires knowledge of the UL channel vectors hoy, for 
i = 1 ,K. For this purpose, we assume that B out of the S 
symbols in each UL block are used for pilot transmission. This 
means that in the whole network there exists B orthogonal 
pilot symbols that are shared among the cells. We assume that 
each BS picks K < B pilot symbols uniformly at random 
in each block, to avoid cumbersome pilot coordination. This 
means that on average K/B of the cells reuse any given pilot 
symbol. We call /? = > 1 the pilot reuse factor such that 

/ 3I\ = B < S. In this setting, the pilot symbol, sent by 
the typical UE and received at the typical BS, is interfered 
with by the subset of cells in which there is another UE (still 
indexed by k without loss of generality) reusing the same pilot 
symbol. This is modeled through a binary stochastic variable 
Xokj £ {0,1}, where \okj = 1 means that UE k in cell j 
uses the same pilot as the typical UE and thus causes pilot 
contamination. This occurs with probability K/B. Similarly, 
Xokj = 0 means that there is currently no pilot contamination 
from cell j, and it occurs with probability 1 — K/B = 1 — 1 /8. 
Therefore, the typical UE k transmits a single pilot symbol 
(e.g., Sofc = 1) and BSo receives the following signal: 

z ok = VPofc(V / l — e 2 + eofc)hoofc 

T ^ ^ Xokj \/ Pjk (\/1 T Gjk)^-0jk T no- ^ ^ 

je** 

We compute the minimum mean-squared error (MMSE) esti¬ 
mate of hoo/. from the observation Zofc. 


Lemma 2. The MMSE estimate of the typical UE’s channel 
to its serving BSq is 


hoofc — 


puid° 


1 + T,j^ x XOkjJrf + ! y 


-Zo k- 


(5) 


The estimation error Ah 0 ofc = h 0 ofc — h 0 ofc is distributed as 
Ahoo/j ~ CA7(0, Coofc), where the estimation error covari¬ 
ance matrix Coofc £ C MxM is given by 


Coo/c — 


cod'. 


1 - 


1 — e 2 


OOfc 


l + Ejevp x X0fey#7 + 


I -M- 


( 6 ) 


Proof: This follows from standard results on MMSE 
estimation; see (43] Chapter 15.8]. ■ 


III. Average Spectral Efficiency 

We assume that MRC processing is used at the BSs for 
data recovery, since MRC is computationally efficient and 
performs well in both small cells (with K = 1) and massive 
MIMO (with M 3> K). In particular, the symbol transmitted 
by the typical UE is detected at BSo by correlating the 
received signal in ([3] with the MMSE estimate hoofc; that is, 
fok = U)OfchQ 0fc yo is the received signal after MRC processing 
with i/oofc £ C being a scaling factor (see Appendix A). 
Unfortunately, the ergodic capacity for a network (such as 
the one under investigation) in which only imperfect CSI 
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M( 1 - e 2 ) 2 


+ E E^ + f)( 1 + 5 s |t + f)+M(i-aU E(fe) 2 + « 2 ) 

je*A»=i 0jI /\ mfc / \ je** v 0jfc/ / 


A' 


M(1 - e 2 ) 2 


( A "+ t)( 1 + ^= 2 ) + t) + ^( 1+ ^) + f + 5=l) + M ( 1 ~ e2 )(^I)+ e2 ) 


( 8 ) 


( 10 ) 


is available at the receivers and the inter-cell interference is 
modeled as a shot-noise process is not known yet [36) . Hence, 
in this work the SE expression is obtained using classical 
achievable lower bounds on the ergodic capacity that are 
particularly common in the massive MIMO literature © 
© ( 33 ). The following result holds for any given realization 
of H/a and the UE locations. 

Lemma 3. With MRC processing and any given realization 
of \ and the UE locations, a lower bound on the ergodic 
capacity [bit/symbol/user] of the typical UE is 

(l-^)log 2 (l + SINR 0fc ) (7) 

where the pre-log factor (l — accounts for pilot overhead 

and SINRofc, in at the top of the page, is the effective SINR. 

Proof: The proof is given in Appendix A and proceeds 
as follows. First, the lower-bounding approach from J44) and 
m is used to obtain the mutual information in the presence 
of imperfect CSI. Then, the statistical properties of the MMSE 
estimator and power-control policy are exploited to compute 
the average with respect to the channels in the presence of 
pilot contamination. ■ 

The expression in Lemma [3] holds for any 3 > 1 such that 
/3I\ < S, since the pilot signals need to be contained in a 
coherence block. Observe also that fBK does not need to be 
an integer since an arbitrary (3K can be achieved by switching 
(for appropriate fractions of time) between the closest smaller 
integer, [(3K\, and the closest larger integer, \j3K~\. 

The average SE per UE can be obtained by taking the 
expectation of 0 with respect to the PPP 'lT\ and the UE 
locations. This would require heavy numerical evaluations of 
integrals. As a key contribution of this paper, in the following 
we provide a tractable and tight lower bound on the average 
SE. 

Proposition 1. If MRC is employed, a lower bound on the UL 
average SE [bit/symbol/user] is 

SE = (9) 

where SINR is given in © at the top of the page. 

Proof: The proof is given in Appendix B and is based on 
Jensen’s inequality, which leads to tractable expressions since 
only moments of the terms in 0 need to be computed. ■ 
The tightness of the SE bound in Proposition [T] is demon¬ 
strated later on by means of simulations (see Fig. 0. The 
numerator of SINR in (|T()| scales with M due to the array 


gain from coherent processing, but the hardware impairments 
cause the multiplicative loss (1 — e 2 ) 2 . The last term in the 
denominator scales with M and also with the sum of 
and e 2 . The former term is called coherent pilot contamination 
since it accounts for interference received from UEs that use 
the same pilot symbol as the typical UE, while the latter is due 
to the distortion noise emitted from the typical UE itself. Many 
of the interference terms in ( flQ| increase with K since having 
more UEs lead to more transmit power per cell. Note that ( fi7j| 
is independent of the BS density A, due to the power-control 
policy in 0. The average transmit power per UE in <[2]) is, 
however, proportional to A _a / 2 so that less power is needed (to 
sustain a fixed SNR per UE) as the network becomes denser. 


IV. Problem Statement and Energy Efficiency 
Optimization 

As described in Section [T| we concentrate on the UL EE 
defined as the benefit-cost ratio between the area spectral 
efficiency (ASE) [bit/symbol/km 2 ] and the area power con¬ 
sumption (APC) [J/symbol/km 2 ]. Using the novel and tight 
lower bound from Proposition [T] in this work the ASE is 
obtained as 

ASE = AA" SE. (11) 


To specify the APC, we begin by observing that with the 
adopted power-control policy the average radiated power per 
UE is 


r(a/2 +1) 

PUJ (ttA)«/ 2 

( 12 ) 

where we have used 0 and the fact that each user transmits 
one pilot symbol and S — BK data symbols per block. Then, 
we observe that the APC must account not only for the radiated 
power, but also for the dissipation in analog hardware, digital 
signal processing, backhaul signaling, and other overhead costs 
(such as cooling and power supply losses). A detailed and 
generic model that takes all these factors into account was 
recently proposed in |23| and is such that (using the same 
notation as in (23) for simplicity) the APC is computed as 


S - (fiK - 1) 
S~ 


E {Pji} = ( 1 - 


BK - 1 


APC = A 


1 - 


/3K — 1 \ put T{a/2 + 1) 


S 


rj (7rA)“/ 2 


K 


+C 0 +C 1 K+V 0 M+V 1 MKj+A- ASE (13) 


where p £ (0,1] is the linear power amplifier efficiency, 
Co models the static power consumption at a BS, and DqM 
models the power consumption of the BS transceiver chains. 
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which scales with the number of BS antennas. Moreover, 
C\K + T) \ M K models the power consumed at the UEs and 
also by the signal processing tasks at the BS, except for the 
coding and decoding that are proportional to the number of 
bits (with proportionality constant A). Notice that ( fl3| > has 
an intuitive polynomial structure in M, K, and p. We refer 
to p3) for further modeling details. The forthcoming analysis 
holds for any positive values of the above parameters, but some 
examples are later given in Table [I] 

The objective of this work is, for any given frame length 
(.S'), propagation parameters (a,ui), and hardware character¬ 
istics (r],e,A,Co,Ci,T>o,T>i), to find the tuple of parameters 
9 = (f3, p, A, K, M) that solves the following constrained EE 
maximization problem: 


maximize 

eee 

subject to 


EE(0) 


ASE(0) 

APC(0) 


SINR = 7 


(14) 


where 0 is the feasible parameter set defined as 


0 = {9 : p > 0, A > 0, /3 > 1, (M, K) e Z+, K/3 < S} 

(15) 

with K/3 < S being the upper limit on the pilot signaling 
overhead. The parameter 7 > 0 in © is used to impose 
an average SE constraint of log 2 (l + 7) [bit/symbol/user], 
where the average is computed with respect to both BS and UE 
locations. This constraint is needed to obtain a network with an 
acceptable throughput, since unconstrained EE maximization 
often leads operating points with very low SEs per UE; this 
is later on illustrated in Fig. [2] where the EE increases as 7 
decreases. 

In the remainder of this section, we analyze the EE max¬ 
imization problem in ( fl4| > to expose fundamental behaviors 
and to develop an algorithm for solving it. 


A. Feasibility 

Due to the unavoidable inter-cell interference in cellular 
networks, the optimization problem ( fl4| is only feasible for 
some values of 7. This feasible range is obtained as follows: 


Lemma 4. The optimization problem ( |14| > is feasible if 
S (a — 1)(1 — e 2 ) 

7 < . 2 or -TT- (16) 

1 + — 1 ) 

Proof: Observe that SINR in ( [Tol l is a monotonically 
increasing function of M. Therefore, an upper limit can be 
computed by letting M —> 00 . This yields 

(1- 


sinr^^ 


e 2)2 




(17) 


which is an increasing function of the optimization variable 
(3. Under the constraint BK < S, the maximal value of 3 is 
S and this value is obtained for K = 1. Putting all these facts 
together and using simple algebra lead to ( [T6l >. ■ 

The above lemma shows that the maximal SINR level is 
limited only by the hardware impairments, through e, and by 
the severity of the pilot contamination, which is determined 
by the pathloss exponent a and the coherence block length S. 


These are the only limiting factors as M — 00 . Assume for 
example that e = 0.05 (39), |40) and consider the relatively 
conservative propagation parameters a = 3 and S = 200. For 
these numbers, the upper limit of the average SE per UE is 
log 2 (l + 199.5) ~ 7.65, which is substantially higher than 
the SE of contemporary systems (42) . This means that the 
optimization problem © is feasible in most cases of practical 
interest. 


B. Optimal Pilot Reuse Factor (3 

We begin by deriving the optimal value of the pilot reuse 
factor (3 when the other optimization variables are fixed. 

Theorem 1. Consider any set of {p. A, M. K } for which 
the problem © is feasible. The SINR constraint in © is 
satisfied by selecting 

Ri7 


r = 


where 

Bi = 


AK 


(a — 2)' 


+ 


M( 1 - e 2 ) 2 - B 27 
K + M( 1-e 2 ) 2 ( k + a~)' 


(18) 


a — 1 


a — 2 


(19) 


b -( a ' + 7 + S )( 1 + 7 ) +(1 -' Vm - 

( 20 ) 

Proof: By gathering the terms that contain f3 and using 
©, the SINR constraint can be rewritten as 

M(1 — e 2 ) 2 


7 = 


( 21 ) 


R1//3 + B2 

with Bi and B 2 given by © and (20) . We then obtain ( p~8j ) 
by solving ( |2Tj ) for (3- ■ 

The above theorem provides insights on how the EE-optimal 
pilot reuse factor /?* depends on the other system parameters. 
Firstly, recall that increasing [3 translates into allocating a 
larger portion of each UL block for pilot transmission, so 
that each pilot symbol is on average only used in 1//3 of the 
cells in the network. This leads to higher channel estimation 
accuracy and less coherent pilot contamination. Secondly, [3 * 
is an increasing function of B\ and also of /i 2 , since a larger 
B -2 makes the denominator smaller. Consequently, Theorem 
[T] shows that to guarantee a certain average SINR, (3* must 
increase with K. This is intuitive since more UEs per cell 
means more inter-cell interference, which can be partially 
suppressed by increasing the estimation accuracy and reducing 
the pilot contamination; namely, using a larger /?. Similarly, 
/?* decreases with p since higher transmit powers reduce 
the detrimental impact of noise, leading to higher estimation 
accuracy and to a more interference-limited regime. Moreover, 
B* is a decreasing function of M since an improved array gain 
makes the system less sensitive to interference and estimation 
errors. Increasing the pathloss exponent a leads to a smaller [3 * 
(since B\ and /i 2 are reduced), which is natural since inter¬ 
cell interference decays more quickly. Thirdly, the fact that 
/3 > 1 implies that we can only achieve values of 7 for which 
M (i-c 2 ) 2 -B 2 'y — 1’ otherwise even (3 = 1 would provide an 
SINR higher than 7. 
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5 M(l-f 2l )2-B 27 ) l°S2(l+7) 


EE(/3*) = 


( 1 + i-f M(i-f^-B 27 )^rg^+Co+Ci^+PoM+P 1 M^+^(l-f M(1 -f^_ B27 )log 2 (l+ 7 ) 


(23) 


^( 1 -f M(l-f^-B 27 ) 1 °g2(l + 7) 


EEoo — 


Co + C X K + V 0 M + V\MK + AI<(1 - f M/d^rg^ ) log 2 (l + 7) 


(25) 


C. Optimal BS Density and Radiated Power 

Next, we optimize the BS density and radiated power. 
Plugging the optimal (3* from Theorem [T] into (|74j), the EE 
maximization problem reduces to 


maximize 
p, A>0, M,Ke z + 

subject to 


EE(/3*) 


B 17 
M( 1 - e 2 ) 2 
_ -BiT 

M(1 - e 2 ) 2 


B 2 7 


> 1 

_ S_ 

B 2 7 “ K 


( 22 ) 


with EE(/)*) given in ( |23) at the top of the page. 

The optimal values for the BS density A and the power- 
control coefficient p are given in the following theorem. 


Theorem 2. Define p = Xp for p > 0 and consider any set 
of {/5, M, A} for which the problem ( |22| ) is feasible. Then, 
EE(/3*) is a monotonically increasing function of A and is 
maximized as A —* oo. The average transmit power per UE 
in <0 will then go to zero. 


Proof: The objective function EE(/3*) in ([23) is a mono¬ 
tonically increasing function of A since the transmit power 
term decreases with A proportionally to p/X a / 2 = p/ A Q/,2_1 . 
The EE is also an increasing function of the factor (1 — 
^ M(i_eTP~_Bo 7 ) an d ^is expression also increases with A 
since B\ and B 2 are decreasing with p = Xp. Therefore, 
EE(/3*) is maximized as A —i oo. ■ 

This theorem proves that from an EE perspective it is 
preferable to have as high BS density as possible. This might 
be unexpected since smaller cells lead to more interfering 
UEs in the vicinity of each cell, but this issue is resolved 
by the assumed power-control policy that gradually reduces 
the transmit power as the BS density increases. The main 
consequence of letting A grow large is thus that the transmit 
power becomes negligible as compared to the circuit power in 
each cell. 

Clearly, an infinitely high BS density is infeasible in prac¬ 
tice. However, the numerical results in Section [V] show that 
the asymptotic limit is almost achieved already at the modest 
density of A = 10 BS/km 2 . There are two factors that restrain 
the practical BS density: 1) the dimensionality of the BS 
equipment that limits the inter-BS distances; and 2) the UE 
density that limits the number of BSs that can serve I\ UEs 
each. In ultra-dense networks, only a subset of all BSs are 
turned on at a given point in time; load balancing can be 
used to make sure that each active BS serves the most energy- 
efficient number of UEs, while the remaining ones are placed 


in sleep mode. This is further analyzed in Section V-C 


D. Optimal Number of Antennas and UEs per BS 

By using Theorem [2] the EE maximization problem in ( |22) 
further reduces to 


maximize EEoo 

M,A'eZ + 


subject to 


_ B17 

M(1 - e 2 ) 2 
_ B17 

M(1 - e 2 ) 2 


B 2 7 


> 1 



B 2 7 K 


(24) 


where EE^ is given in ( |25) at the top of the page and we 
have defined 


Hi = K 


(a — 2) 2 


a — 1 


2 \ M(1 — e 2 ) 

a-2 ) a — 1 ’ 

(26) 


B 2 = K 



+ (1 


e 2 )e 2 M. 


(27) 


To find the optimal values for Ad and K, an integer-relaxed 
version of ( |24) is first considered where M and K can be any 
positive scalars. The integer-valued solutions are then extracted 
from the relaxed problem. For analytic tractability, we replace 
M with c = M/A', which is the number of BS antennas per 
UE. For a given c, the EE-maximizing value of K is found as 
follows. 


Theorem 3. Consider the optimization problem m where 
M and K are relaxed to be real-valued variables. For any 
fixed c = MIK > 0 such that the relaxed problem is feasible 
for some K, the EE is maximized by 


K* = 
where 


\J(GC 0 ) + CqD\c + CqG (Ci + Vqc) — GC 0 


V\C + G (Ci + Vq c) 


G = — 


[ V i 7 I 2 7 \ , 7 (1— e 2 ) - 

1 y (a— 2 ) 2 ' a— 1 ' a-2 J ' a-1 C 


(1 - e 2 ) (1 - (1 + 7)e 2 ) c - (l + 7 


(28) 


(29) 


Proof: If c is given, then we want to maximize (using the 
notation of the theorem and 1Z = log 2 (l +7)) 


A(1 - KG)K 

C 0 + (Ci + V q c)K + VicK 2 + AK(l - KG)K' 


(30) 


It is straightforward to show that © is a quasi-concave 
function of K (e.g., using the approach in the proof of Lemma 
3 in (23|l). The maximizing value I\* is thus obtained by 
taking the first derivative of ( |30) and equating to zero. This 
leads to the expression in (|28). The first constraint in (|24) 




























apQ3 Q 4 

a 5 


(I0CL1CL4 
a 5 


«0 a 3 a 4 

CL2CL5 


a 0 QiQ 3 I Q 0 Q ; 

Cl 2 ^2 


( 31 ) 


Q-i + 03 + \ CL1CI3 a -1 + 


Cli CL2CL4 
a 5 


a 2 - a 0 


is independent of iv and thus fulfilled if the problem was 
feasible. The solution also satisfies the second constraint, 
namely K < 1/G, since the EE in l [30| is negative for 
K > 1/G and thus is not maximized in thus range (and the 
zero objective function for K = 0 would then be higher). ■ 
Similarly, if K is fixed, then the EE-maximizing value of c 
is obtained as follows. 


Theorem 4. Consider the optimization problem \2A\ where 
c = M/K and K are relaxed to be real-valued variables. For 
any fixed I\ > 0 such that the relaxed problem is feasible, 
the EE is maximized by c* in © at the top of the page, 
if this solution satisfies the first inequality constraint in ©• 
Otherwise, the EE is maximized by 


(l_ e 2)(l_(l + 7 ) e 2)_7(^i 

The following parameters are used in © and ( |32[ l: 

yK(l — e 2 ) 

“° = S(a-l) 

K / 47 7 27 \ 

1 5 \ (a — 2) 2 ~*”a—1 a — 2/ 

a 2 = (l-e 2 )(l-(l + 7 )e 2 ) 



(14 — Co + C\K 
a 5 =V 0 I< + V X K 2 . 


(32) 


(33) 

(34) 

(35) 

(36) 

(37) 

(38) 


Proof: Using the notation introduced in the theorem, the 
objective function in ( |24| ) reduces to 

K{ l-gj^)log 2 (l+7) 
a 4 +a 5 c + AK( 1 - log 2 (l + 7 ) 

which can be easily shown to be a quasi-concave function of 
c. The maximizing value c* is thus obtained by taking the first 
derivative of (|39j» and equating to zero. This yields the solution 
in ( [3T| >. Note that c* needs to satisfy the first constraint in ( |24| . 
If this is not the case with ( |3Tj ), then the EE is monotonically 
increasing for all feasible c and the largest value is obtained 
when there is equality in the first constraint, which occurs at 
the value in ( |32| . Finally, we notice that c* satisfies the second 
constraint in ( |24| ) automatically because the maximum cannot 
give a negative EE. ■ 

The above two theorems show how K and c (and also M = 
cK) are related at the EE-optimal points. It turns out that 
K* decreases with c, proportionally to \J 1 /c when c grows 
large. Similarly, M* = c*K is found to increase with K. 
The intuition is that more BS antennas should be deployed to 
control the interference when more UEs are served. In addition 
to this, the results of Theorems [3] and [4] reveal how the power 


consumption coefficients impact K* and M*, respectively. In 
particular, from ( [28] ) it is found that K * increases with the 
static power consumption Co, while it decreases with C\, V 0 , 
and V 1 that are the terms of the APC that increase with K 
and M. Similarly, M* increases with Co and Ci, but decreases 
with T >0 and T>\. The simple intuition behind these scaling 
behaviors is that more hardware should be turned on (i.e., BS 
antennas and UEs) only if the increase in circuit power has a 
marginal effect on the total APC. Similarly, with a larger static 
consumption we can afford to turn on more BS antennas and 
UEs since the relative power cost is lower. 

Using Theorems [3] and [4] we can devise an alternating 
optimization algorithm to solve the integer-relaxed EE maxi¬ 
mization problem: 

1) Find a feasible starting point ( M,K ) to ©; 

2) Optimize K for a fixed M using Theorem [3j 

3) Optimize M for a fixed K using Theorem B] 

4) Repeat 2)-3) until convergence is achieved. 

This algorithm converges since the EE has a finite upper 
bound and the EE increases monotonically in each iteration. 
In fact, it converges to the global optimum of the relaxed 
problem. 

Corollary 1. The integer-relaxed version of the EE max¬ 
imization problem © is quasi-concave. Hence, the alter¬ 
nating algorithm converges to the global optimal solution 
(K**, M**) £ R 2 to the integer-relaxed version of the 
problem. 


Proof: The alternating optimization algorithm is guar¬ 


anteed to converge to a local maximum according to 1 46 


Proposition 4]. Suppose that there are multiple local maxima; 
for example, {M\,Kf) and (M 2 ,/i 2 ). By setting M = 

kIZkI k + and var y in s K > we can search on 

the line between these local maxima. For this choice of M, 
it is straightforward to show that the EE is a quasi-concave 
function of K. Hence, only one of the points can be a true 
local maximum, which is a contradiction to the existence of 
multiple local maxima; thus, the algorithm converges to the 
only local/global optimum of the problem. ■ 

The real-valued solution (**) obtained from the 
alternating optimization algorithm is a good starting point 
for finding the integer-valued global optimum to ( |24| ). In 
particular, the quasi-concavity implies that the integer-solution 
is contained in a convex level set around ( K**,M **). In 
many cases, it is one of the four integer points obtained by 
respectively taking the floor and ceiling of K** and M**. In 
general, the global optimum is obtained by searching through 
all integers in the vicinity of the real-valued solution, keeping 
in mind that the EE is quasi-concave in all directions. 

To summarize, the original problem formulation in ( |T4| has 
been solved through the following steps: i) by selecting /3 to 
satisfy the SINR constraint; ii) by letting A — > 00 (which also 
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TABLE I 

Simulation Parameters 


System Parameter 

Symbol 

Value 

Hardware Parameter 

Symbol 

Value 

Coherence block length 

S 

400 

Coding/decoding/backhaul 

A 

1.15- 10" 9 [J/bit] 

Pathloss exponent 

a 

3.76 

Static power consumption 

Co 

10 W ■ r [J/symbol] 

Propagation loss at 1 km 

i0 

130 dB 

Circuit power per active user 

Ci 

0.1 W • r [J/symbol] 

Power amplifier efficiency 

V 

0.39 

Circuit power per BS antenna 

Vo 

0.2 W • r [J/symbol] 

Level of hardware impairments 

e 

0.05 

Signal processing coefficient 

Vi 

1.56 ■ 10 _1 ° [J/symbol] 

Symbol time 

T 

oTTTTT [s/symbol] 

Noise variance 

a 2 

10 _2 ° [J/symbol] 


makes p —> 0 ); in) by devising an alternating optimization al¬ 
gorithm that provides the real-valued M and K that maximize 
the EE; iv) finally, by searching through the integer points in 
the vicinity of the real-valued solution, and capitalizing on 
the quasi-concavity. In the process of solving ( fl4| . Theorems 
m have also exposed the fundamental interplay among the 
optimization variables and how they depend on the hardware 
characteristics and propagation parameters. 

V. Numerical Examples 

Numerical results are used in this section to validate the 
average SE expression provided in Proposition [T] and the 
fundamental results established in Section El The results 
are obtained for the parameter setting summarized in Table 
|T] where we assume for instance a coherence block length 
of S = 400 (e.g., T c = 4 ms and W c = 100 kHz) and a 
bandwidth of 20 MHz (giving the symbol time 1/(2 • 10') s). 
The hardware parameters are inspired by a variety of prior 
works; for example, (23) and references therein. The simula¬ 
tions were performed using Matlab and the code is available 
for download at https://github.com/emilbjomson/maximal-EE, 
which enables reproducibility as well as simple testing of other 
parameter values. 

A. Optimizing the Energy Efficiency 

We begin by recalling that Theorem [2] proves that the EE 
maximization problem in ( f]~4] ) is solved when the BS density 
is infinitely large; that is, A -A oo. We now illustrate how large 
A needs to be for applying this asymptotic result in practice. 
To this end. Fig. [2] shows the EE as a function of A, and 
the other optimization variables are optimized numerically for 
each given value of A. Three different SINR constraints are 
considered in Fig. [2] 7 £ {1, 3, 7} which corresponds to the 
average SEs log 2 (l+ 7 ) € {1, 2, 3}. In all three cases, the EE 
is computed using both the lower bound on the average SE in 
Proposition [T] and an upper bound obtained by averaging over 
the instantaneous SE derived in Lemma [3] using Monte Carlo 
simulations^ 

Several important observations can be made from the results 
presented in Fig. [2] Firstly, there is only a small gap (that 
reduces as 7 takes larger values) between the lower and upper 
bounds, and the curves behave exactly the same for any value 
of 7 . This validates the accuracy of the SE expression provided 

5 This is an upper bound since we only consider an average of 1000 closest 
interfering BSs, while the exact result requires an infinite number of interferers 
in R 2 . 



Fig. 2. Energy efficiency (in Mbit/J) as a function of the BS density 
in BS/km 2 , for different SINR constraints. Results are obtained by both 
computing upper bounds from Monte-Carlo simulations and using the closed- 
form lower bound from PropositioijT] 

in Proposition |T] Secondly, the EE can be greatly improved 
by increasing the BS density, meaning that small cells are a 
promising solution for maximal EE deployment. However, the 
gain from increasing the BS density saturates in the interval 
from A = 10 to A = 100 BS/km 2 , which roughly corresponds 
to an average inter-BS distance of 100-315 meters. This is 
not small compared to contemporary urban deployments. In 
other words, EE maximization based on letting A —> 00 is 
expected to give representative results in most practical dense 
deployments. Thirdly, we note that the EE decreases as 7 
increases, which is why it is important to specify a target 
SINR; otherwise the EE maximizing operating point might 
be very spectrally inefficient and therefore useless from a 
practical user service perspective. 

We proceed by further studying the impact of varying M 
and K. To this end, we let A -A 00 as prescribed by Theorem 
[2] and fix the SINR constraint at 7 = 3 or, equivalently, the 
average SE per data symbol at log 2 (l + 7 ) =2. Fig. [ 3 ] shows 
the EE lower bound as a function of M and K when 3 is 
optimized according to (18) . The global EE maximum gives an 
EE of 10.156 Mbit/J and is achieved by (M*, K *) = (91,10) 
using the pilot reuse factor /3* = 7.08. Interestingly, this is 
a configuration that falls within the class of massive MIMO 
setups (Tg, (TT), (14). The intuition behind this result is 
that the strong inter-cell interference in dense deployments 
is efficiently mitigated by MRC when the BSs are equipped 
with many antennas and also by using a substantial pilot reuse 
factor (to protect against channel estimation errors and pilot 
contamination). 
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Number of UEs (K) 


Global Optimum: 
M = 91, K = 10 
EE = 10.10 Mbit/J 


100 


50 

Number of 
BS antennas (M) 


Fig. 3. Energy efficiency (in Mbit/J) for 7 = 3. The global optimum is 
star-marked, while the convergence of the alternating algorithm from Section 
IIY-Dl is indicated with circles. 


■4ASE 



Fig. 4. Relative size of each term in the APC of at the global optimum 
from Fig. [3] 


The results obtained with the alternating optimization al¬ 
gorithm from Section IV-D| are also shown in Fig. [3] The 
initialization point was set to ( M,K ) = (20,1). As seen, the 
algorithm converges after three iterations to the real-valued 
solution (M**, K**) = (91.6,10.1) with an EE of 10.157 
Mbit/J. This real-valued operating point gives only a 0.009% 
higher EE than the integer-valued solution, thus showing that 
the EE performance is quite flat around the global optimum. 


To study the global optimum even further, the pie diagram in 
Fig. [4] shows the relative size of each term in the APC of ( fl3| ) 
(the transmit power is not shown since it takes negligible val¬ 
ues). The dominating terms are the static power consumption 
Co and the consumption T>qM of the BS transceiver chains. 
These seem to be the main factors to improve in order to make 
the hardware more energy-efficient. Observe that the above 
results might be different for other values of Co, Ci, Vq, and 

2?i. 



Fig. 5. Energy efficiency (in Mbit/J) as a function of the level of transceiver 
hardware impairments, e, for different SINR constraints. 


B. Impact of Transceiver Hardware Impairments 

Next, we consider the saturation regime where A — > oo and 
exemplify the impact of transceiver hardware impairments on 
the EE. Fig. [5] shows the EE as a function of e, the level of 
hardware impairments. As expected, the EE decreases with e 
since the desired signal power in ( [I()| decays as (1 —e 2 ) 2 . The 
loss is marginal for 7 = 1, but it can be relatively large when 
7 increases. This is in line with the previous results in J39) , 
wherein it is shown that hardware impairments greatly affect 
the channel capacity in the high SNR regime (i.e., for large 
7 ) while their impact is negligible in the low SNR regime. 
The results of Fig. [5] indicate that for the investigated network 
the EE loss due to hardware impairments is negligible for 
e < 0.1 when 7 £ {1, 3}. Since these values correspond to 
the operating points that give the highest EE (see Fig. [2}, 
we may conclude that modest levels of hardware impairments 
have a negligible impact on networks designed for high EE. 


C. EE Maximization for a Given UE Density 

So far, the BSs have been deployed to match an unlimited 
heterogeneous UE distribution. In particular, we have assumed 
that each BS serves K UEs such that a BS density of A 
BS/krn 2 corresponds to an average of K A UEs per km 2 . 
Hence, as the BS density grows large we also let the average 
UE density grow large. To validate the plausibility of this 
model, suppose that we instead deploy the BSs to match a 
fixed average UE density of p UE/km 2 . Mathematically, this 
amounts to solving ( fl4| ) with the additional constraint 

p = KX. (40) 

We will study how such an extra constraint affects the results, 
taking into account that future average UE densities from p = 
10 2 UE/km 2 (in rural areas) to p = 10 5 UE/km 2 (in shopping 
malls) have been predicted in the METIS project |47) . 

Fig. 0 shows the EE has a function of the UE density p 
for the average SINR level 7 = 3 , while Fig. [7] shows the 
corresponding BS density. The design parameters M, K, j3 , 
A, and p are optimized numerically according to ( fl4| , with 
the additional constraint Q. Two reference cases are also 
considered: single-user single-input multiple-output (SIMO) 
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Fig. 6. Energy efficiency (in Mbit/J) vs. the UE density fi. The EE is 
optimized according to ED with the extra constraint p = K A, or only with 
respect to (A,/3,p) for given M and K. 



Fig. 7. Optimized BS density (in BS/km 2 ) vs. the UE density (in UE/km 2 ). 
The system is optimized in the same way as in Fig. [6] 


transmission with (M,K) = (10,1); and massive multi-user 
MIMO transmission with ( M,K) = (91,10), which was 
shown in Fig. [3] to be optimal as A —> oo. Clearly, only /3, 
A, and p are optimized in the two reference cases. 

Several important observations can be made from the results 
presented in Figs. [6] and [7] Firstly, the EE level becomes 
independent of the UE density when p is large enough; the 
saturation occurs for p > 100 in the optimal and the fixed 
massive MIMO cases, while it occurs for p > 2 in the single- 
user SIMO case. The future UE density range p £ [10 2 ,10 5 ] 
predicted in 63 is fully covered in these saturation regimes. 
In all practical relevant cases we can, according to Fig. [7] 
simply scale the BS density linearly with the UE density, 
while using the same number of BS antennas and UEs per 
BS. Similarly, one can turn on and off BSs to account for 
variations in the UE load. Secondly, the fixed massive MIMO 
configuration ( M,K ) = (91,10) achieves the maximal EE in 
the saturation regime, while it is inefficient as compared to 
single-user MISO for low UE densities (e.g., for p < 8). In 
contrast, single-user SIMO transmission performs reasonably 
well at low UE densities, but saturates earlier and at an 
EE level that is 3.14x lower than the maximal EE. More 
importantly. Fig. [7] shows that the single-user case requires a 
10x higher BS density in the saturation regime, which might 


greatly reduce the deployment cost. 

In summary, the transmission power appears to be negli¬ 
gible, as compared to the circuit power, for networks that 
can handle future UE densities. By adding massive multi-user 
MIMO capability to the BSs, the EE can be increased by a few 
hundred percentages, and the even more substantial benefit is 
that it can reduce the BS density with an order of magnitude; 
this is likely to be a key property to achieve a cost and energy 
efficient network densihcation. 


VI. Conclusion 


Network densihcation is the key for achieving high EE in 
cellular networks, and can be realized by either having many 
BSs or many antennas per BS. In order to find the optimal 
densihed network configuration, we formulated an UL EE 
maximization problem under the assumption of a stochastic 
BS deployment based on Poisson point processes. By deriving 
a new lower bound on the average SE in the network and 
using a state-of-the-art power consumption model, the EE 
expression became tractable and was maximized analytically 
with respect to the density of BSs, the transmit power levels, 
the number of BS antennas and users per cell, and the pilot 
reuse factor. The closed-form expressions provided general 
guidelines on the optimal operating regimes and exposed 
the fundamental interplay among the optimization variables, 
hardware characteristics, and propagation environment. The 
analysis focused on the UL, but similar results are expected 
in the DL due to the UL/DL duality concept. 

The analysis shows that reducing the cell size is undoubtable 
the way towards high EE, but the positive effect of increasing 
the BS density saturates when the circuit power dominates over 
the transmission power. A further leap in EE can typically be 
achieved by adding extra BS antennas to multiplex several 
UEs per cell; the numerical examples resulted in an EE 
maximum with 91 BS antennas and 10 UEs, which is a 
massive MIMO setup. The EE gains come from suppressing 
intra-cell interference and by sharing the circuit power costs 
among multiple UEs. We stress that a hundred antennas can 
be deployed also at a small-size BS, since we consider simple 
dipoles and possibly higher frequencies than in contemporary 
networks. Moreover, the analysis shows that a large pilot reuse 
factor can protect against inter-cell interference and can be 
tailored to guarantee a certain average SE. 

While this paper focused on the ratio between spectral 
efficiency and energy costs, it is straightforward to also in¬ 
clude the deployment cost, site renting, and other economical 
factors; the annual depreciation expense can be turned into an 
energy-equivalent form (by multiplying with the energy cost 
in J/Euro) and incorporated in the coefficients Co, Ci, and Ai 
in m This is likely to mainly increase the constant term Co, 
thus pushing the optimal operating point towards having fewer 
BSs, with more antennas and UEs per cell. 
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Appendix A: Proof of Lemma[3] 

Let voo/c = ^oofchooA; denote the MRC detector. Then, the 
received signal is given by 

K 

VoOfcYO = ^2 \/P0i(V / l e 2 S 0 z + e o0 v OOA;hoO2 
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(41) 

The ergodic capacity of this channel is defined as the maximal 
mutual information, which unfortunately cannot be computed 
exactly under imperfect CSI. However, we can compute a 
rigorous lower bound using the well-established bounding 
technique from p4) and (45}. More precisely, the first term 
in ( pH) is treated as the only desired signal while the other 
terms are uncorrelated with the first one and treated as worst- 
case Gaussian noise in the detection (a worst-case assumption). 
This leads to the mutual information log 2 (l + SINRofc) where 
SINRo/c is the ratio between the desired power and the “noise” 
power: 

SINR ofc =p 0 fc(l - e 2 )|E{vo 0fe h 00 fe}| 2 
/(p ofc (l - e 2 ) (E{|vg 0fc h 0 ofc| 2 } - |E{vQ 0fc h 00 fc}| 2 ) 

+ Po fc e 2 E{|v£ ofc hoofc| 2 } + y^p oi E{|v£ ofc hooi| 2 } 
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+ E E^i iE 'Ll v oofc h OR| 2 }+ E {|vo 0 fcno| 2 }) (42) 

jG^x(=1 

where we have used the fact that E{ | \/1 — e 2 Sji -p | 2 } = 
1 — e 2 + e 2 = 1. To proceed further with the computation of 
the expectations in ( f42| ), we define 

Tok = 1+ E Xoki^~ + (43) 

Zg*, Olk P 


and notice that the MMSE estimate of ho,/,, (for j = 0 or 
j £ suc h that Xokj = 1) takes the form 
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whereas the independent estimation error is distributed as 
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For normalization purposes, the scaling factor z/oofc is selected 
as _ 
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which follows by using ( |46| and the orthogonality principle of 
MMSE estimators as well as the adopted power-control policy 
in 0- Since the noise is independent of the channels, we also 
have 
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(48) 


Next, we consider the terms in ( |42| that are affected by pilot 
contamination. For convenience, we let Xoko = 1 since the 
typical UE uses the same pilot as itself. Then, for all users 
i in cells j that are not causing pilot contamination (i.e., for 
i Z k or for i = k and xokj Z 1), we have 
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where we have taken into account that hoofc is independent 
of iii , and the last equality follows from averaging over the 
stochastic pilot selection (with E{xofc;} = 1 //? by design). 

For all users i in cells j such that i = k and \okj = 1, 
the channel estimate h 00 fc is correlated with h ny , due to pilot 
contamination and thus we obtain 
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where the first equality follows from the expansion ho y /,- = 

h 0 j k + Ah 0jk and the fact that h(y fc = J dji %t 00k hoofc- The 

y Oj k 

second equality follows from using ]48| Lemma 2] and ( 1461 . 
whereas the third equality simplifies the expression taking into 
account that Ejxofcz} = 1//3. Since the second term in ( [50] ) is 
equal to ( [49] ), it follows that it is only the first term in ( [50| that 
comes from pilot contamination and its probability to appear 
is Ejxofei} = 1//3- Plugging (|47|)—([50]) into © the result in 
the lemma follows. 


the terms. To compute both, we let k = 1 or k 2 and notice 
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(53) 


where (a) follows from computing the average number of BSs 
in the circular area and by letting j be an arbitrary BS index 
in 3k a and k be an arbitrary UE index (since each term in 
the sum has the same marginal distribution). Next, (6) divides 
the expectation into one conditional expectation where djjk 
is given and one outer expectation with respect to djjk■ Step 
(c) and (d) compute the inner expectation by utilizing that 
the BSs of other cells are uniformly distributed in the circle 
of radius r at distances larger than djjk . The limit r — > oo in 
(e) is taken both inside and outside the expectation, which is 
allowed since the dominated convergence theorem is satisfied]^] 
The final expression is obtained by computing E{d 2 jfc } as 
in ( [52] ) and exploiting the typicality, which implies that djjk 
and duo/;; have the same marginal distribution. Using similar 
techniques as in ([53]) yields 


Appendix B: Proof of PropositionQ] 


The goal is to compute a tractable lower bound on the 
average SE (1— ^)E{log 2 (l + SINR 0 fc)}, where the expecta¬ 
tion is taken with respect to the PPP and UE locations. To 
this end, the Jensen’s inequality is applied as E{log 2 (l + 
SINR^ 1 )} > log 2 (l + E { SINR - fc i}) to move the expectation 
inside the logarithm. The expectation of the inverse SINR is 
then expanded as in ( |5Tj ) at the top of the page. 

To proceed further, we focus on the BSs in a circular area of 
finite radius r and consider wrap around in the radial domain 
to keep the translation invariance. Note that by letting r —> oo 
a PPP in the whole of R 2 is obtained, and there is no bias since 
the intra-cell distances are assumed to be distributed exactly as 
in R 2 . The reason for doing this is that the average number of 
interfering BSs in the limited circular area is finite and given 
by 7r(r 2 — E{dQ 0fc }) = ir (r 2 — \), where we have used the 
fact that dook ~ Rayleigh ( jy==) (see Lemma IT1) to obtain 


EWoJ = 


r(i//2 +1) 
(ttA)"/ 2 


(52) 


E 


Iee; 


= K- 


I j e * A i=i oji 

The only remaining term in (|5l) is 
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where the equality follows from computing the expectations 
separately over sets that contain different BS indices. The 
first term in (| 5 5 [> is computed as 


for any V > 2. Two of the expectations in ( |5 1 1 ) contain 6 The dominated convergence theorem can be applied since < 

summations over the PPP ’T but with different powers of 1. 
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by first computing the expectation with respect to the number 
of terms in the PPP for a finite radius r and exploiting that 
j / l- By letting r —> oo we arrived at an expression that is 
basically twice the expression previously given in ( f53] ). The 
second term in (|55T> is bounded as 
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(57) 


where (a) computes the expectation with respect to the number 
of terms in the PPP for a finite r, ( b ) follows from Holder’s 
inequality, and (c) exploits that the two expectations are equal. 
At this point, the expression is basically the same as the 
second expression in ( |53) l, thus (d) follows from the same 
steps as those taken in ( |53| . Plugging ( |56| ) and ( [57] ) into ( |55j ) 
we eventually obtain 
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(58) 
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We have now computed all the expectations in m exactly or 
as upper bounds. Using all these expressions eventually lead 
to the achievable lower bound in (flQ|. 
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